A New Cosine Series Antialiasing Function and its Application to Aliasing-Free Glottal Source Models for Speech and Singing Synthesis
نویسندگان
چکیده
We formulated and implemented a procedure to generate aliasing-free excitation source signals. It uses a new antialiasing filter in the continuous time domain followed by an IIR digital filter for response equalization. We introduced a cosineseries-based general design procedure for the new antialiasing function. We applied this new procedure to implement the antialiased Fujisaki–Ljungqvist model. We also applied it to revise our previous implementation of the antialiased Fant– Liljencrants model. A combination of these signals and a lattice implementation of the time varying vocal tract model provides a reliable and flexible basis to test fo extractors and source aperiodicity analysis methods. MATLAB implementations of these antialiased excitation source models are available as part of our open source tools for speech science.
منابع مشابه
Implementations of synthesis models for speech and singing
The current implementations of the synthesis models for speech and singing are described. An improved model for speech is presented and compared to the model currently in use. A new singing synthesis model has recently been implemcn~ed in a signal-processing board. The differences between these models are pointed out. Test results from comparative measurements on synthetic speech synthesis arc ...
متن کاملVoice source model for continuous control of pitch period.
The voiced speech waveform may be synthesized by exciting an LPC vocal tract filter with a pulse waveform patterned after naturally occurring glottal airflow pulses. Such a pulse waveform may be generated by computing samples of a piecewise polynomial curve at equally spaced time intervals. In this type of synthesis, the pitch period is commonly restricted to an integer multiple of the sample i...
متن کاملGlottal source modeling for singing voice synthesis
Naturalness of sound quality is essential for singing-voice synthesis. Since 95% of singing is voiced sound (Cook, 1990), the focus of this paper is to improve the naturalness of the vowel tone quality via glottal excitation modeling. We propose to use the LF-model (Fant et al., 1985) for the glottal wave shape in conjunction with pitch-synchronous, amplitude-modulated Gaussian noise, which add...
متن کاملHow to precisely measure the volume velocity transfer function of physical vocal tract models by external excitation
Recently, 3D printing has been increasingly used to create physical models of the vocal tract with geometries obtained from magnetic resonance imaging. These printed models allow measuring the vocal tract transfer function, which is not reliably possible in vivo for the vocal tract of living humans. The transfer functions enable the detailed examination of the acoustic effects of specific artic...
متن کاملVoiced Speech Synthesis Using Pitch Asynchronous Code Excited Linear Filters for the Glottal Source
This paper proposes a model for natural quality voiced speech synthesis using code excited linear all-pole filter for modeling the glottal source signal. Classical glottal signal models are explicit-time functions which inhibit joint sourcetract parameter estimation and require pitch synchronous estimation with precise segmentation of open and closed glottis phase. These problems are overcome i...
متن کامل